Computational Baby Learning
نویسندگان
چکیده
A baby inherently possesses the capability of recognizing a new visual concept (e.g., chair, dog) by learning from only very few positive instances taught by parent(s) or others, and this recognition capability can then be gradually further improved by exploring and/or interacting with the real instances in the physical world. In this work, we aim to build a computational model to interpret and mimic this baby learning process, based on prior knowledge modelling, exemplar learning, and learning with video contexts. The prior knowledge of a baby, inherited through genes or accumulated from life experience, is modelled with a pre-trained Convolutional Neural Network (CNN), and the convolution layers form the knowledge base of the baby brain. When very few instances of a new concept are taught, an initial concept detector is built by exemplar learning over the deep features from the pre-trained CNN. Furthermore, when the baby explores the physical world, once a positive instance is detected/identified with high score, the baby shall further observe/track the variable instance possibly from different view-angles and/or different distances, and thus more instances are accumulated. We mimic this process by the massive online unlabeled videos and well-designed tracking solution. Then the concept detector can be fine-tuned based on these new instances. This process can be repeated again and again till the baby has a very mature concept detector in the brain. Extensive experiments on Pascal VOC-07/10/12 object detection datasets [8] well demonstrate the effectiveness of the proposed computational baby learning framework. It can beat the state-of-the-art full-training based performances by learning from only two positive instances for each object category, along with 20,000 videos which mimic the baby exploration of the physical world.
منابع مشابه
Baby Cry Sound Detection: A Comparison of Hand Crafted Features and Deep Learning Approach
Baby cry sound detection allows parents to be automatically alerted when their baby is crying. Current solutions in home environment ask for a client-server architecture where an end-node device streams the audio to a centralized server in charge of the detection. Even providing the best performances, these solutions raise power consumption and privacy issues. For these reasons, interest has re...
متن کاملThe Role of the Trainer in Reinforcement Learning
In this paper we propose a three-stage incremental approach to the development of autonomous agents. We discuss some issues about the characteristics which differentiate reinforcement programs (RPs), and define the trainer as a particular kind of RP. We present a set of results obtained running experiments with a trainer which provides guidance to the AutonoMouse, our mouse-sized autonomous rob...
متن کاملOn the interplay between “ learning , memory , prospection and abstraction ” in cumulatively learning baby humanoids
We all ‘inhabit’ continuously changing unstructured worlds where neither everything can be known nor can everything be experienced. The lifelong interplay between neural mechanisms associated with learning, memory, prospection and abstraction play a fundamental role in enabling cognitive agents to effortlessly connect their ‘past’ with the ‘available present’ and ‘possible future’, most often i...
متن کاملSituated Nonmonotonic Temporal Reasoning with BABY-SIT
After a review of situation theory and previous attempts at ‘computational’ situation theory, we present a new programming environment, BABY-SIT, which is based on situation theory. We then demonstrate how problems requiring formal temporal reasoning can be solved in this framework. Specifically, the Yale Shooting Problem, which is commonly regarded as a canonical problem for nonmonotonic tempo...
متن کاملEmotional interactions as a way to structure learning
Since several years, we are interested in understanding how babies learn to recognize facial expressions without having a teaching signal allowing to associate for instance an “happy face” with their own internal emotional state of happiness (Gergely and Watson, 1999). Using the cognitive system algebra (Gaussier, 2001), we showed a simple sensori-motor architecture using a classical conditioni...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1411.2861 شماره
صفحات -
تاریخ انتشار 2014